(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) Worid Intellectual Property OrganizatioD 
International Bureau 




(43) International Publication Date (10) International Publication Number 

12 September 2002 (12.09.2002) pCT WO 02/071690 A2 



(51) IntematioDal Patent Classification^: H04L 12/24 

(21) International Application Number: PCT/CA02/00287 

(22) International Filing Date: 5 March 2002 (05.03.2002) 

(25) FiUng Language: English 

(26) Publication Language: English 



(30) Priority Data: 

60/273,547 



7 March 2001 (07.03.2001) US 



(71) Applicant: MERITON NETWORKS INC. [CA/CA]; 
3026 Solandt Road, Kanata, Ontario K2K 2A5 (CA). 

(72) Inventors: WU, Quansheng; 25 Woodridge Cr., Apt 
1112, Nepean, Ontario K2B 7T4 (CA). LIU, LIng-Zhong; 
1053 Alenmede Cres., Ottawa, Ontario K2B 8H2 (CA). 

(74) Agent: MACGREGOR, George; c/o Maiks & Clerk, P.O. 
Box 957. Station B, Ottawa, Ontario KIP 5S7 (CA). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, CZ, 
DE, DK, DM, DZ, EE, ES, FT, GB, GD, GE. GH, GM, HR, 
HU, ID, XL, IN, IS, JP, KB, KG. KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX. MZ, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL. TJ. m, 
TR, TT, TZ, UA, UG, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regioml)i ARIFO patent (GH, GM. 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Emasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG, a, CM, GA, GN, GQ, GW, ML, MR, 
NE. SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



= (54) Title: AUTOMATIC CONTROL PLANE RECOVERY FOR AGILE OPTICAL NETWORKS 



< 
o 

o 

o 



fOSPF) 



{ iP Service Provider "[ 



-c 



r 



Lower Layers h 



AA 

Software arcbttectore of antomatic control chaaBCl rccoveiy 



(57) Abstract: This application proposes a solution for pro- 
viding fast auto-recoveiy of the control plane network against a 
control link failure in an optical coimnonications network. The 
described solution applies to both protected and unprotected 
control channels. If a control channel is protected, the solution 
is triggered only when the protection control channel cannot re- 
sume the connectivity. In a control link failure situation each 
node in a neighboring pair attempts to find an alternate control 
route before informing the network manager of the link failure. 
If an alternate route is established, the control plane is quickly 
re-established without involving system resources. 
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AUTOMATIC CONTROL PLANE RECOVERY FOR AGILE OFIICAL NETWORKS 

[0001] This invention claims the benefit of U.S. Provisional Application No. 60/273,547 
filed March 7, 2001. 

5 

Field Of The favention : 

[0002] This invention relates to conununications systems, and more particularly to flie 
automatic recov^ of control signals in the event of a control link failure between 
neighboring nodes in an optical communications system. 

10 

Background 

[0003] Future agile optical networks will need a reliable and robust control networic to 
ensure quality service. These control networks are made up of multiple control channels. 
The control network may be implemented either in-band, in which flie control information 
15 is embedded in the data channel, or out-of-band, in which the control network uses an 
independent control channel separated firom the data channels. Multiple choices exist to 
deploy an out-of-band control plane network. 

[0004] Agile optical networks are expected to quickly and automatically provision lightpath 
20 on the request of customers. Successful provisioning depeads on two basic functions of the 
control network. The first function is routing, which automatically updates the optical 
network topology and related resource information so that a node can compute a route for a 
lightpath for the request The second function is signaling, by which the nodes along a route 
can exchange information to set up or tear down a Ughlpath without user intervention. 
25 • 

[0005] Most of the current control network approaches are based on the extension of the 
existing Memet Protocols (IP). The standard Ihtemet routing protocols, OSPF (Open 
Shortest Path First) and IS-IS (Intermediate System to Intermediate System), are extended 
to exchange optical network routing information and construct the optical routing 
30 information database. These protocols rely on the instant and periodic exchange of the linV 
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State infonnation between a directly connected (physically or logically) pair of nodes, 
called neighbors. These protocols ensure the routing functionality of an optical network. 
The standard signaUng protocol, MPLS (Multi-protocol Label Switching), is extended to 
GMPLS (Generalized MPLS) to support the signaling functionality. This extended protocol 
5 uses the routing databases to set up or tear down a lightpath. The CSMPLS signaling 

protocol assumes that the control plane has the same topology as the data plane regardless 
of whetiier flie control network is in-band or out-of-band. Furthermore, the newly 
developed LMP (Link Management Protocol) needs at least one control channel to be set up 
between two ndglhboring nodes. 

10 

[0006] The OSPF and IS-IS protocols are limited in that their design is based on flie 
assumption that the data and control (routing) information is transmitted by the same 
underlying data link, i.e., in-band control. This means that the health of the data plane 
reflects that of the control plane. In the context of the out-of-band control plane of optical 

15 networks, this assumption is not always true. For example, if a control plane is established 
by an IP network, an intermediate router failure wiU make the control channel inaccessible, 
but the data plane on the optical side may still be functioning. This means that the control 
. plane topology iio longer reflects the optical data plane topology. Even if the OSPF can 
ensure flie accessibility of the control messages by re-routing, the control network topology 

20 is changed because the neighboring relationship has been changed in the control plane. The 
data and control infonnation no longer match each other. 

[0007] The robustness of optical networics depends on the ability to quickly re-establish the 
control plane network and neigihbor relationship when £dlures occur in the control 
25 channels. Current control networks rely on reporting any link failure to the IGP (Interior 
Gateway Protocol) engine. The IGP will flood the network with topology changes 
potentially, reducing the stability of the network. 

30 
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Summary of the Invention 

[0008] The present invention can apply to both in-band and out-of-band control channels. It 
cotild be an in-fiber control plane in which the control information is transported by a 
dedicated wavelength or sub-wavelength in a data channel. It could be an out-of-fiber 
control plane in which the control information is exchanged by a network that does not use 
the fibers connecting the optical nodes. It could be a mixture of in-fiber and out-of-fiber 
connections working together to form a control network. The invention is suitable for all 
cases. The reliability of the control network can be reinforced by deploying redundant 
protection control links between the optical nodes. The robustness of the control network 
relies on the capability of the control network to autonoatically recover from control link 
failures. 

[0009] This application provides a solution for fast auto-recovery of the control plane 
network in a control link failure. This sohition appUes to both protected and unprotected 
control channels. If a control channel is protected, this solution is triggered only when the 
protection control channel cannot resume the connectivity; i.e. when the protection channel 
has failed as well 

[0010] Therefore, in accordance with a first aspect of the present invention, there is 
provided a method of performing automatic recovery of a control plane network in the 
event of a control link failure in an optical communications sj^tem comprising: detecting a 
failure in a control hrk between neighboring switch nodes; searching for an alternate route 
between the neighboring switch nodes; if an alternate route is located, switching the control 
plane to the alternate route» and notifying respective switch nodes of the alternate route. 

[001 1] In accordance with a second aspect of tiie invention, there is provided a system for 
performing automatic recovery of a control plane network in the event of a control link 
failure in an optical communications system comprising: a link manager for detecting a 
failure in a control link between neighboring switch nodes; and a control chaimel manager 
for searching for an alternate route between the neighboring switch nodes, for switching the 

3 
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control plane to the alternate route if an alternate route is located, and for notifying 
respective switch nodes of the alternate route. 

Brief Descriptjon of the Drawings 
5 [0012] The invention will now be described in greater detail with reference to the attached 
drawings wherein; 

[0013] Figure 1 illustrates the software architecture of an automatic control channel 
recovery scheme; and 

10 

[0014] Figure 2 illustrates one embodiment of a control plane network. 
Detailed Description of the Invention 

[0015] The basic underlying principle of the present invention is to maintain the neighbor 
15 relationship when a control channel between a pair of optical nodes goes down or out of 
service. Instead of reporting the failure immediately to the IGP engine, which will, in turn, 
drop the neighbor relationship, the control plane will try to establish an alternate chaimel 
through an alternate route by itself. Once such a channel is set up successfully, the control 
plane switches the failed primary chaimel silently and transparently to the alternate one 
20 without notifying the IGP engine and other rsppec layer applications, such as GMPLS. This 
fast and transparent recovery significantly reduces IGP flooding, thus improving the 
stability of tiie control networics. Furthermore, the alternate control channel can be treated 
as a temporary repair of the control network. Once the fEiilure in the primary channel has 
been repaired, the alternate control channel can be switched back to the primary channel, 
25 without detection by the other control network apphcations. The alternate control channel 
can then be torn down. This switch-back can be triggered manually by an operator, or 
automatically when the primary control channel has been repaired. 
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[00 1 6] This solution applies to all of the possible control network deployments: in-fiber, 
out-of-fiber and a mixture of the two. It also applies to protected control channels, when the 
protection scheme fails to maintain control channel connectivity. 

5 [0017] Figure 1 shows a possible implementation of the proposed solution. 

[0018] The key conq)onents of this implementation are shown in Figure 1 and are described 
in the following discussion. 

10 [0019] The LM (Link Manager) is responsible for managing and monitoring the control 
channels that connect pairs of nodes. The LM interacts with tiie lower layer mechanisms, 
such as LMP (Link Management Protocol), to detect the health of the control channels. 
Once a failure in a control channel has been detected, the LM will report the failure to the 
CCM (Control Ghannel Manager) along with the identifier of the failed control channel. 

15 Once a control channel is re-estabhshed, the CCM notifies the LM that the control channel 
is now back in service. 

[0020] The CCM manages the control channels, and is able to set up or tear down control 
channels. It interacts with the routing engine to maintain knowledge of the control network 
20 topology. It maintains two databases: the Routing Table that holds the initial topology of 
the control network, and the FRT (Forward Redirection Table) that is dynamically updated 
with the IP forwarding interfaces of the local nodes. 

[0021] The FRT is a mapping table of the JP forwarding interfaces of tfie local nodes . It 
25 provides information to the IPF (BP Forwarder) on how and where to redirect flie JP traffic. 

[0022] The IPF forwards the IP packets according to the information &om the Routing 
Table and the FRT. When the IPF receives an IP packet to forward, it consults the routing 
table by the destination IP address, and gets an outgoing forwarding interface. Before 
30 forwarding the packet, the IPF gets the iq>dated outgoing interface from the FRT, then 

5 
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forwards the packet to that interface. A more detailed forwarding procedure description is 
given in the exano^ple to follow. 

[0023] An IPSP (IP Services Provider) offers TP services to the upper layer a^jplications, hi 
5 addition to normal IP services, PSP enables ^Ucations to estafoli^ or tear down an IP 
tunnel, e.g. IP-in-IP tunnel. 

[0024] The OSPF and the Routing Table update the routing and forwarding infonnation. 

10 [0025] Figuie 2 shows as an example of the implementation of this solution in a control 
plane netwoik. In this configuration, three optical switches, nodes A, B and C, are 
connected, by fibers to form a ring. Bi-directional control channels are established 
mirroring the data plane topology (cntl_A-B, cntl_B-C and cntl_C-A). The control channels 
are estabhshed through iii-fiber connections using IP over SONET technology. The IP stack 

15 on the node ensures that the control channel has JP connectivity. An optical extended IGP 
OSPF maintains two topology databases: the CNLSDB (Control Network Link State 
Database), and OLSDB (Optical Data Plane Netsvork Link State Database). In this 
configuration, the Routing Table and the FRT are shown in the tables 1 and 2 respectively. 



Destination 


Outgoing 
Interface 


Node B 


I/F1 


NodeC 


VF2 



20 

Table 1. Routing Table of Node A 



25 
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From Interface 


To Interface 






VF2 


Vf2 



Table 2. Forward Redirection Table of Node A 

5 [0026] When a failure occurs on the control channel between node A and B (e.g. the fiber is 
cut, or the laser is burnt out), the control channel connectivity between node A and B goes 
down. The LM on node A or node B will detect the failure, and report it to its CCM with 
the control channel identifier. Instead of reporting the failure immediately to the OSPF (that 
would instantly trigger flooding the network with updates), each CCM will try to estabhsh 

10 an alternate channel by itself The CCM on the node with the larger node ID (node A), 
looks up the CNLSDB of OSPF, and tries to find a route between node A and B that 
excludes the link between node A and B (because it has failed). In this example, the route 
A-C-B can be found. The CCM of node A then creates an IP-in-IP tunnel through the 
interface I/F2 of node A to the interface VF2 of node B. Once the tunnel is set up 

15 successfully, the CCM of node A will send a message through the tunnel to the CCM of 
node B to request it to set up an IP-in-IP tunnel back to node A. Once the two tunnels are 
set up successfully, the CCMs on both nodes switch the control channel to the P tunnels. 
The CCMs then update the FRTs to map the previous interface (I/Fl) onto the IP tunnel 
inter&ce ( I/F_TunneM). 

20 

[0027] The updated FRT of node A is shown in table 3, Similarly, the CCM on node B 
updates the FRT on node B. The routing tables on both nodes stay unchanged. 
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From Interface 


To Interface 




i/F_TunneL1 


VF2 


I/F2 



Table 3, Updated Forward Redirection Table of Node A 

[0028] The CCMs then notify the corresponding LMs on bofli nodies that the control 
5 channel between A and B has been re-established with the same control chamel identifier. 
The replacement of the control channel is transparent to the LM and to the OSPF. 

[0029] This procedure is based on the assumption that the time to establish the IP tunnel 
and to iq)date FRT would be much shorter than the OSPF's *liello message timeouf ' 

10 (typically 30 seconds). This solution prevents the OSPF from flooding the network with 
topology changes caused by a link failure. As the FRT is built into the IP forwarder, the 
forward redirection is transparent to the upper layer IP applications. It is worth noting that 
the CCM saves the previous control channel infonnation. Whai the failure has been 
repaired, the CCM can switch the control channel back to the previous control channel by 

15 just restoring the FRT. This switch-back can be done automatically by CCM, or manually 
triggered by an operator. After the switch-back is done, the operator can choose to maintain 
the IP tunnel for later use, or tear it down and release the resources. The CCM can be 
configured to perfomi these operations automatically. 

20 [0030] If the CCM cannot establish an alternative IP tunnel between A and B, it will notify 
OSPF of the link failure, which, in turn, will flood it into the network. 

[003 1] As a possible variation to the implementation described above, the TP tunnel can be 
replaced by an LSP (Label Switched Path), using MPLS protocol. In this case, an MPLS 
25 data plane must be implemented on all the nodes. 

8 
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[0032] Tbis solution can be applied directly to control network protection channels for a 
fast and transparent switch-over of an active control channel to a redundant one. The CCM 
keeps the active and redundant control channel information. When a failure occurs on the 
5 active channel, the CCMs of the node-pair update the FRTs to redirect the control traffic 
from the active channel to the back-up one. Again, the switch-back can be easily 
acconfiplished by iq)dating the FRTs appropriately. 

[0033] Although particular embodiments of the invention have been described and 
1 0 illustrated, it will be parent to one skilled in the art that numerous changes can be made 
without departing from the basic concept It is to be understood, however, that such changes 
will fall within the full scope of the invention as defined by the appended claims. 
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We Claim: 

1. A method of perfomiing automatic recovery of a control plane network in the event 
of a control link failure in an optical communications system comprising: 
detecting a failure in a control link between neighboring nodes; 

searching for an alternate route between the neighboring nodes; 

if an alternate route is located, switching the control plane to the alternate route; and 

notifying respective switch nodes of the altemate route. 

2. The method according to claim 1 wherein said control plane network employs 
totemet Protocol (IP) technology, 

3. The me&od according to claim 2 wherein said control plane network is on an in- 
bandlink. 

4. The method according to claim 3 wherein said in-band link is a wavelength channel 
carried on an optical fiber, 

5. The method according to claim 2 wherein said control plane network is on an out- 
of-band link. 

6. The method according to claim 2 wherein said alternate route is an IP tunnel 
between said neighboring nodes. 

7. The method according to claim 1 wherein, if said link failure is repaired, said 
control link is switched back to the original link. 

8. The method according to claim 7 wherein said control link is switched back to said 
original link automatically. 

9. The method according to claim 7 wherein said control link is switched back to said 
original link manually by an operator. 

10. The method according to claim 1 wherein, if an altemate route is not located within 
a preset interval, a search is conducted for an altemate route through the complete 
network. 

1 1 . The method according to claim 1 for use in a protected system wherein the control 
link has a predefined alternate route. 

10 
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12. The method as defined in claim 1 for use in an unprotected system wherein the 
control link does not have a predefined alternate route. 

13. A system for perfoiming automatic recovery of a control plane network in the ev«it 
of a control link failure in an optical communications system comprising: 

a link manager for detecting a failure in a control, link between neighboring nodes; 
and 

a control channel manager for searching for an alternate route between the 
ndgihboring nodes, for switching the control plane to the alternate route if an 
alternate route is located, and for notifying respective nodes of the alternate route. 

14. The system as defined in claim 13 wherein said control channel nianager has an 
information database for maintaining information on the control network. 

15. The system as defined in claim 14 wherein said information database stores a 
forwarding redirection table that maps forwarding interfaces. 

16. The system as defined in claim 13 having an IP forwarder for forwarding 
information firom a routing table and the forwarding redirection table. 
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Figure 1* Software architectare of automatic control chanuel recovery 
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Figure 2. A Control Plane Network 
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(57) Abstract: This ^plication 
proposes a solution for providing 
fast auto-recovery of the 
control plane network against a 
control link failure in an optical 
communications network. The 
described solution applies to both 
protected and unprotected control 
channels. If a control channel 
is protected, the solution is 
triggered only when the protection 
control channel cannot resume 
the connectivity. In a control 
hnk failure situation each node 
in a neighboring pair attempts 
to find an alternate control route 
before infoiming the network 
manager of the link failure. If 
an alternate route is established, 
the control plane is quickly 
re-established without involving 
system resources. 
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